Robust connected word speech recognition using weighted viterbi algorithm and context-dependent temporal constraints

نویسندگان

Néstor Becerra Yoma

Lee Luan Ling

Sandra Dotto Stump

چکیده

This paper addresses the problem of connected word speech recognition with signals corrupted by additive and convolutional noise. Context-dependent temporal constraints are proposed and compared with the ordinary temporal restrictions, and used in combination with the weighted Viterbi algorithm which had been tested with isolated word recognition experiments in previous papers. Connected-word recognition tests show that the weighted Viterbi algorithm depends on the accuracy of the state duration modelling and the approach here covered can lead to reductions as high as 90 or 95% in the error rate at moderate SNR using spectral subtraction, an easily implemented technique, even with a poor estimation for noise and without using any information about the speaker. It is also shown that the weighting procedure can reduce the error rate when cepstral mean normalization is also used to cancel both additive and convolutional noise.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Temporal constraints in viterbi alignment for speech recognition in noise

This paper addresses the problem of temporal constraints in the Viterbi algorithm using conditional transition probabilities. The results here presented suggest that in a speaker dependent small vocabulary task the statistical modelling of state durations is not relevant if the max and min state duration restrictions are imposed, and that truncated probability densities give better results than...

متن کامل

Context-dependent word duration modelling for robust speech recognition

Conventional hidden Markov models (HMMs) have weak duration constraints. This may cause the decoder to produce word matches with unrealistic durations in noisy situations. This paper describes techniques for modelling context-dependent word duration cues and incorporating them directly in a multi-stack decoding algorithm. The proposed model is capable of penalising duration constraints of a wor...

متن کامل

Weighted Viterbi algorithm and state duration modelling for speech recognition in noise

A weighted Viterbi algorithm (HMM) is proposed and applied in combination with spectral subtraction and Cepstral Mean Normalization to cancel both additive and convolutional noises in speech recognition. The weighted Viterbi approach is compared and used in combination with state duration modelling. The results presented in this paper show that a proper weight on the information provided by sta...

متن کامل

Robust speech recognition based on Viterbi Bayesian predictive classification

In this paper, we investigate a new Bayesian predictive classi cation (BPC) approach to realize robust speech recognition when there exist mismatches between training and test conditions but no accurate knowledge of the mismatch mechanism is available. A speci c approximate BPC algorithm called Viterbi BPC (VBPC) is proposed for both isolated word and continuous speech recognition. The proposed...

متن کامل

Applying word duration constraints by using unrolled HMMs

Conventional HMMs have weak duration constraints. In noisy conditions, the mismatch between corrupted speech signals and models trained on clean speech may cause the decoder to produce word matches with unrealistic durations. This paper presents a simple way to incorporate word duration constraints by unrolling HMMs to form a lattice where word duration probabilities can be applied directly to ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1999

Robust connected word speech recognition using weighted viterbi algorithm and context-dependent temporal constraints

نویسندگان

چکیده

منابع مشابه

Temporal constraints in viterbi alignment for speech recognition in noise

Context-dependent word duration modelling for robust speech recognition

Weighted Viterbi algorithm and state duration modelling for speech recognition in noise

Robust speech recognition based on Viterbi Bayesian predictive classification

Applying word duration constraints by using unrolled HMMs

عنوان ژورنال:

اشتراک گذاری